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ABSTRACT 

The highly nonlinear regime of gravitational clustering is characterized by the presence 
of scale-invariance in the hierarchy of many-body correlation functions. Although the 
exact nature of this correlation hierarchy can only be obtained by solving the full set of 
BBGKY equations, useful insights can be obtained by investigating the consequences 
of a generic scaling ansatz. Extending earlier studies by Bernardeau & Schaeffer (1992) 
we calculate the detailed consequences of such scaling for the implied behaviour of 
a number of statistical descriptors, including some new ones, developed to provide 
useful diagnostics of scale-invariance. We generalise the two-point cumulant correlators 
(now familiar to the literature) to a hierarchy of multi-point cumulant correlators 
(MCC) and introduce the concept of reduced cumulant correlators (RCC) and their 
related generating functions. The description of these quantities in diagrammatic form 
is particularly attractive. We show that every new vertex of the tree representation of 
higher-order correlations has its own reduced cumulant associated with it and, in the 
limit of large separations, MCCs of arbitrary order can be expressed in terms of RCCs 
of the same and lower order. Generating functions for these RCCs are related to the 
generating functions of the underlying tree vertices for matter distribution. Relating 
the generating functions of RCCs with the statistics of collapsed objects suggests a 
scaling ansatz of a very general form for the many-body correlation functions which, 
in turn, induces a similar hierarchy for the correlation functions of overdense regions. 
In this vein, we compute the lower-order Sn parameters and two-point cumulant 
correlators Cnm for overdense regions and study how they vary as a function of the 
initial power spectrum of primordial density fluctuations. We also show that these 
results match those obtained by the extended Press-Schechter formalism in the limit 
of large mass. 

Key words: Cosmology: theory - large-scale structure of the Universe - Methods: 
statistical 



1 INTRODUCTION 



Gravity is scale-free. In the absence of any externally-imposed length scale, such as might be set by initial conditions, it is 
therefore reasonable to assume that gravitational clustering of particles in the universe should evolve towards a scale-invariant 
form, at least on small scales where gravitational effects presumably dominate over initial conditions. Observations offer 
support for such an idea, in that the observed two-point correlation function £(x) of galaxies is reasonably well represented 
by a power law over quite a large range of length scales, 



(Groth & Peebles 1977; Davis & Peebles 1983) for, say, r between 100/i -1 kpc and 10/i -1 Mpc. Higher-order correlation 
functions of galaxies also appear to satisfy a scale-invariant form, with £jv oc as expected from the application of a 
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general scaling ansatz (Groth & Peebles 1977; Fry & Peebles 1978; Davis & Peebles 1983; Szapudi et al. 1992). For example, 
the observed three point function, £3, is well-established to have a hierarchical form 

£,:i( x a, Xfc, X c ) = ^atc = Q{£,ab^bc + £,ac^ab + ^ac^bc), (2) 

where £ ai) = £(x a ,x 6 ), etc. In a similar spirit, the four-point correlation function can be expressed as a combination of 
four-point graphs with two different topologies - "snake" and "star" - with corresponding amplitudes R a and Rb respectively. 

£ 4 (x a ,x 6 ,x c ,x d ) = £ abcd = R a (€ab£bcticd + ■•■ (12 terms)) + Rb(£ab£ac£ad + . . . (4 terms)) (3) 

(Fry & Peebles 1978, Fry 1984a, Bonometto et al. 1990, Valdarini & Borgani 1991). It is natural on the basis of these 
considerations to infer that all N-point correlation functions can be expressed as a sum over all possible N-tree graphs with 
(in general) different amplitudes for each tree diagram. It is also generally assumed that there is no shape-dependence in these 
tree amplitudes in the highly non-linear regime: 

(Jv-i) 

&v(ri, . . . rjv) = 

Q ( N- trees labellings edges 

Some numerical experiments have confirmed the validity of such an assumption, at least as far as three-point correlation 
function is concerned (Scoccimarro et al. 1998). 

This tree-level model of hierarchical clustering however is a particular case of a more general scaling ansatz proposed by 
Balian & Schaeffcr (1988, 1989), in which the iV-point correlation functions can be written in the form 

^ N (Xr 1 ,...Xv N )=X-^ N - 1) ^ N (v 1 ,...r N ). (5) 

Rigorous theoretical motivation for these assumptions are, however, yet to be forthcoming. Considerable progress has been 
made recently in understanding the evolution of gravitational clustering in limit of weak clustering, i.e. in the quasi-linear 
regime (e.g. Sahni & Coles 1995). In particular, it has been shown that, in the quasi-linear regime, a tree-level hierarchy of 
correlation functions develops which satisfies |jv oc at the limit of vanishing variance. On the other hand, the intermediate 

regime, where loop corrections to tree-level diagrams become important, and the highly non-linear regime, where perturbative 
calculations break down completely, remain poorly understood and answers to many questions are still unclear. 

Theoretical studies of the evolution of correlation functions in the highly nonlinear regime have mainly concentrated 
on possible closure schemes for the Born-Bogoliubov-Green-Kirkwood-Yvon (BBGKY) equations (Davis & Peebles 1977), 
stability properties of these closure schemes (Ruamsuwan & Fry 1992) and different phase-space separation schemes for 
hierarchical solutions of arbitrary order (Fry 1982; Hamilton 1988). Although a general solution of the BBGKY equations in 
nonlinear regime is still lacking, various interesting characteristics of the count probability distribution function (CPDF) and 
void probability distribution function (VPF) have been established by Balian & Schaeffer(1992). These predictions have also 
been tested against numerical simulations, in both two and three dimensions, and were found to be in good agreement. 

However, most of the analytical results available so far relate to quantities associated only with one-point cumulants. 
Examples are the so-called Sn parameters, as well as the CPDF and VPF mentioned above. Multi-point cumulant correlators 
(MCC), which we shall introduce in this paper, are a natural generalization of the Sn parameters and the two-point cumulant 
correlators (2CC) which are generally used to analyze galaxy catalogs and compare observational data with different models of 
structure formation. Cumulant correlators were proposed recently by Szapudi & Szalay (1997) and shown to be very efficient 
for extracting clustering information from galaxy surveys. In particular, they can be used to test gravitational instability 
scenario and constrain the initial power spectral index. Analytical predictions regarding the 2CCs were made by Bernardeau 
(1995) using tree-level perturbation theory in the limit of large separation. In particular, he showed how the bias of overdense 
cells with respect to the underlying mass distribution is related to the generating function of the 2CCs. These predictions 
were also successfully tested against numerical simulations. Such calculations were also extended for application to projected 
(angular) catalogues and have now been shown to be in reasonable agreement with estimates of these quantities extracted 
from the APM survey (Munshi, Melott & Coles 1998). 

Bernardeau & Schaeffer (1992) developed a systematic series expansion for the multi-point void probability distribution 
function (MVPF) which is essentially the generating function of the multi-point count probability distribution function 
(MCPDF). This expansion again applies only in the highly nonlinear regime. Using such a series expansion, based on powers 
of £ij /£j where £» is the variance in the ith cell, they were able to evaluate the bias associated with overdense regions. The 
bias thus calculated, which was analytically shown to depend only on a scaling variable associated with the collapsed object 
and an intrinsic property of the object, was later found to be in very good agreement with numerical calculations. It is clearly 
an interesting task to extend such analysis further and check predictions against numerical simulations. 

If one can believe that galaxies are in some way identified with overdensities in the matter distribution then observed 
estimates of the two-point cumulant correlators from a galaxy survey depend upon the bias of such regions relative to the 
underlying mass distribution. Multipoint cumulant correlators allow more information to be deduced about the properties of 
bias, because the quantities related to the Sn parameters of overdense cells. Complete knowledge of all Sn parameters (which 
are related to the generating function of MCCs to arbitrary order) can also help us in constructing the count probability 
distribution, and related void probability distribution, for collapsed objects. 

Unfortunately, there is at present no robust theoretical framework other than the hierarchical ansatz within which such 
objects can be studied. One-point statistics of collapsed objects have been studied by Mo & White (1996) using the extended 



Press-Schechter formalism (Press & Schechter 1974; Bond et al. 1991; Bower 1991; Lacey & Cole 1993; Kauffmann & White 
1993) but it remains to be seen whether this formulation can be used to derive multi-point statistics for collapsed objects. The 
same is largely true of "peaks theory" which is based on an argument used by Kaiser (1984) to explain the strong clustering 
of Abell clusters using the properties of high-density regions in Gaussian fields. This formalism was subsequently developed 
by Bardeen et al.(1986) and its use in cosmology is now widespread. However, there is evidence that correspondence between 
dark haloes and density peaks is not particularly good (Frenk et al. 1998; Katz, Quinn & Gelb 1993) and one needs to 
take the gravitationally-induced motions of peaks into account to compute the final clustering properties of halos. Progress 
has recently been made in overcoming these difficulties by Bond & Myers (1996a, b) in the "Peak-Patch Formalism" at the 
expense of considerable technical complexity. Other efforts to study statistics of collapsed objects include use of the Zel'dovich 
approximation in nonlinear regime after suitably smoothing the initial potential and using the statistics of singularities of 
different types (Catelan et al. 1998; Lee & Shandarin 1997, 1998). 

It is also important to note that earlier derivation of bias and S3 for collapsed objects in the context of hierarchical ansatz 
were done within the context of certain specific approximations and it is interesting to check their validity by extending such 
studies to higher orders, as they are widely used for error estimations of measurements from galaxy catalogs (Szapudi & 
Colombi 1996), as well as the derivation of mass function of collapsed objects (Valageas & Schaeffer 1997). 

In a completely different context it was shown by Colombi et al. (1996) that normalized one point cumulants or Sn 
parameters for underlying mass distribution do not become constant in highly non-linear regime, in direct contrast to hierarchal 
ansatz. Which is also connected to the question, if gravitational clustering at highly non-linear regime do satisfy the stable 
clustering ansatz (Peebles, 1980). Connection between stable clustering and hierarchical ansatz still remains unclear and it 
is also related to the question, if different initial conditions do lead to similar halo profiles. In a recent study, using stability 
analysis of nonlinear clustering Yano & Gouda (1998) showed that stable clustering may not be the most natural outcome of 
gravitational clustering and departure from stable clustering ansatz may mean departure from hierarchical ansatz. However 
their results depend on specific closure schemes. In any case such issues can only be tested if different prediction regarding 
hierarchical ansatz are derived analytically and tested against numerical simulations to find out their limitations, which is 
one of the motivation behind present study. 

The paper is organized as follows. In next section we develop a diagrammatic method to represent cumulant correlators 
of arbitrary order. Using these rules it is also possible to compute these quantities to arbitrary order. In section §3 we use the 
method of Bernardeau & Schaeffer (1992) to compute generating functions of RCCs to 5 th order. In section §4 we use results 
of section §3 to compute Sn parameters for collapsed objects. We discuss these results in the last section, and place them in 
the context of other developments in this field. 



2 MULTI-POINT CUMULANT CORRELATORS AND THEIR GENERATING FUNCTIONS 

Higher-order reduced correlation functions are typically dominated by contributions from high peaks of the density fields. It 
is reasonable, therefore, to infer that the generating functions of MCCs are related in some way to the statistics of collapsed 
objects. According to the hierarchical ansatz, those tree-level diagrams which contribute to reduced correlation function of 
the form (5(x a ) p . . . 8{xh,) w ) are of order p + . . . + w. MCCs can be constructed by identifying p position arguments as x n , 
q arguments as x;,, and so on, with the x a representing the positions of "cells" defined by some implicit smoothing of the 
density field. Not all contributions constructed in this way are of same order of magnitude. The dominant contribution will 
always come from those terms which have the minimum number of external lines joining different cells, since every extra line 
joining different cells contributes an extra factor of « 1 to the overall amplitude of the diagram. 

With such a construction, the complete tree diagram can be separated in two parts: (i) external lines joining different 
cells; and (ii) internal lines joining the different vertices in the same cell in such a way as to keep the external tree topology 
unchanged. The number of internal degrees of freedom of the diagram corresponds to the number of different ways the 
internal vertices in a particular node can be arranged without altering the external connectivity of the tree. The external 
tree configuration connecting different nodes representing different cells on the other hand is very similar to the underlying 
tree structure of matter correlation function and depends only on the number of nodes in the tree. The number of different 
topologies for external tree increases with the number of nodes. 

The difference between the external tree structure connecting different nodes and the underlying tree structure of the 
matter correlations connecting different vertices is that nodes in the tree representation of MCCs carry an internal degree of 
freedom due to the internal tree structure and represent an entire cell, whereas for the underlying matter distribution, each 
vertex represents a single point in space and does not carry any internal structure. If we denote by C p \ the number ways p 
internal points can be arranged with one external leg remaining to be connected with another cell we can express the 2CCs 
as 

<<s p (x a )5 9 ( Xi) )> = [c pl i ab c ql \e +q - 2 = c pq ue +q ~ 2 (6) 

Similarly, we can define C p n as an internal degree of freedom for a node with p internal vertices and 2 external legs. The 
3CCs now can be expressed in terms C p n and C p i. In practice C p n can be described as number of different possible ways in 
which p internal vertices can be rearranged keeping two external legs free: 

(<5 P (x )(5 9 (Xf,)<5 r (x c )) = [Cql^abCpll^acC'rl + Cpl^abCqll^bcCrl + C p l^acC'rll^bcC p lj^ P + q+r 3 = C pqr ^abc^ P + q+T 3 (7) 



A similar decomposition of ACCs and 5CCs can be used to define C p m and C p im: 
{5 p (x. a )S q (x b )8 r (x c )8 a (x d )} = [CqxtabCpntadCsxtacG'r! + (cyclic permutations) 

+ C q i^abCpu^acCrii^cdCsi + (cyclic permutations)] £P+i+ r+s - 4 

— r £ , FP+q+r+s-4. /o\ 

— ^- / P qrsC,abcd£, 5 \o ) 

and 

(<5 p (x a )<5 9 (x 6 )<f (x c )(5 s (x d )5*(x e )) = [C pllll (, ab C ql £ ac C rl (, ae C t i + (cyclic permutations) 

+ Cpin^adCqii^bcCri^adCtiiaeCsi + (cyclic permutations) 

+ Cpi^abCqii^bcCriiicdCsiiideCti + (cyclic permutations)] £i>+<?+ r + s + t 

_ (~< t fp+g + r + s + i-5 /qn 

It is also possible to consider this set of new parameters C p i...i describing the internal degrees of freedom as a set of 
reduced cumulant correlators (RCCs) which can be used to decompose MCCs C p ... v . The number of subscript Is that appear 
in the RCCs C p i...i thus defined represents the number of external trees and p represents the number of internal vertices. The 
total contribution from a cell is equal to product of all vertices in the internal tree structure and of the number of different 
ways one can rearrange the trees keeping the number of internal vertices and external legs fixed. The highest order vertex 
that appears in an internal tree of C p i is p, in C p n it is p + 1 and so on. For the purposes of counting different possible 
arrangements, the external hands are to be considered distinguishable, as they link different external nodes and hence represent 
one particular topology of the external tree. Internal lines joining different vertices carry an weight of £; where, as above, £; 
is the variance in the ith cell. External lines carry an weight of where £ij represents the magnitude of the correlation 
function joining ith and jth cells. Using these rules one can evaluate RCCs to any order. We list here the contributions from 
different topologies at lower orders: 

Cn = 1, C21 = 2v2,C$ = 6k| , C£> = 3u 3 

Cm = V2, C&\ = 2u :i , Cfl = 2ul C&> = 3^4, = 6^2, C&i = 12i*i*, cgl = 61/f , 
C£\ = 4vs, Cjft = 24i*i/f, c| n = 24z/f , = 24*^2, = 24^, c[{\ = 96wl 
Cil\ = 24*^2, C&\ = 12ul C|i\ = 24^ 

Cim = V3,C^ U = 2f 4 ,C^ ) 11 = &V2V:i 

Cam = 6^4, = 18^4, C^ n = \%v 3 vl, C<? n = 18^1, = 3z/ B , = 18^ 2 , etc. (10) 

The superscripts a, b, etc. denote different configurations of internal vertices with fixed external legs and the coefficients in 
each term represent the number number of possible internal rearrangements. We denote the amplitudes associated with n th 
order vertex of matter correlation function by v n . 

The sum of all internal vertices appearing in the tree-representation of MCCs (<5(x a ) p . . . 5{-x.h) w ) will satisfy the following 
equation, which guarantees that only the leading-order diagrams with the minimum numbers of external legs are considered: 

p 1 

5^i + 5^j + ... = 2(p + « + ...)-2 (11) 

i i 

This means that, for one point cumulants, ^ P i = 2p — 2 and for 2CCs ^ P i + ^ 9 j = 2(p + q) — 2. These are the same as 
the conditions met by the leading-order tree- level contributions of cumulants and cumulant correlators (Bernardeau 1995). 
This can also be understood by noticing, for example, that the expression for the 3CC contains factors £a _1 £{| _1 ££ _1 f° r three 
individual cells and two correlation functions connecting three different cells i.e. £ a b£,bc (or any cyclic permutation), so finally 
we get the order to be equal to 2{p + q + r — 3) + 4 = 2(p + q + r) — 2. 

For reasons we shall discuss in the next section, we now define generating function of the RCCs by following equations: 

°° „ 2 3 

n=l 

00 s~i 2 3 

/*»(») = Xl^r 2/TO = _!/22/ + (2l ' 22+2!/3) 2r _(6 ^ + 18l/2i/3+3! ' 4) lr 

71 = 1 

4 

+(4!/ 5 + 48*^4 + 36^ 3 2 + 144i/2 ^3 + 24z/|)|y + • • • 

°° r , 2 3 

W»(v) = V -^V = v z y - (6z/ 2 i/ 3 + 2^4)^- + (36^3 + 18*/;? + 24/^4 +3vs)^r-... (12) 

n=l 

The total number of trees appearing in the representation of the S p parameters (related to C p ) is p p ~ 2 . Similarly, for C p i, the 
total number of terms is p p_1 ; for C p u it is p v , for C p m it is p p+1 and C p i...i in general will contain pP+ r ~ 2 terms, where r 




Figure 1. Graphical representation of MCC (&{x a ) p . . . 8(xh) w ) in terms of RCCs C p i , C' p n , C p m .... It is interesting to note that 
the underlying tree structure reproduces a similar tree hierarchy for the multi-point cumulant correlators. It is also important to note 
that such a decomposition is possible if and only if the hierarchical amplitude v n becomes independent of geometrical form factors, 
which is true only in the highly non-linear regime. The contribution to tree hierarchy can be represented by different tree diagrams 
(Fry 1984), and all possible topologies are considered with fixed number of vertices. While at second order only one type of topology 
is possible, higher-order contributions generate different topologies. For example, in third order (C) one can have cither a "snake" or a 
"star" topology. 

is the number of external lines joining different cells in the tree representation of the RCCs. For one point cumulants, or Sn 
parameters, r = 0, for 2CCs r = 1, and so on. 



3 TREE REPRESENTATION OF MULTI-POINT CUMULANTS 

Let us denote the joint probability distribution function of q different cells having cell occupation of Ni , . . . , N q by 
P(Ni, . . . N q ). The generating function of this distribution is V(\i, . . . , \ q ) = exp[x(Ai, . . . , X q )]. 

exp[x(Ai,...,A,)]=P(Ai,... 1 A,)= ^ P(N U . . . , N q )X^ . . . X^" . (13) 

N lt ...,N q 

Extending the known relation for one-point statistics (Balian & Schaeffer 1989) and employing the vertex generating function 

g m = ES ( - t) "' (14) 

71=1 
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Figure 2. The tree representation for multi-point cumulants can be understood by considering the external tree which represent how 
different cells are linked with each other and the internal trees in each cells, i.e. how internal vertices are arranged in a tree structure with 
fixed number of external legs. In this figure we represent different internal trees which appear in the multi-point cumulant correlators. 



Bernardeau & Schaeffer (1992) showed that it is possible to relate exp(x(Ai, . . . , X q )) to G(r) using 



1 1 
x(Ai,...,A„) = -—( yi G(Tj) - ^VinG' '(Tjj) 



■ si 



(15) 
(16) 



The correlation of different cells enters through fij/fi in eq. ([l6|). If we ignore this term in the second equation, which governs 
the behaviour of n, then the statistics of different cells are independent of each other. This is a kind of zeroth approximation 
when only one-point statistics are used in the description. If we expand (x(Ai, . . . , A 9 )) in a series of powers of then we 

will recover the effect of correlations between different cells of progressively higher order. Expanding eq. (Jl^) we can write: 



7 .(0) +r (D + 



, /iG '( r f + T W + ,.00) = _ £ k to Gf(rf + r« + rf ) 
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(17) 



Corrections up to second order in were computed by Bernardeau & Schaeffer (1992). These and the next higher- 

order terms are as follows: 
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Certain predictions of such a series expansion have already been tested against numerical simulations, at least to linear order. 
Such predictions from the tree-level hierarchy relate to the two-point cumulant correlators (2CCs) which satisfy the relation 
(<5 p (xi)<5 IJ (x2) = [CpiC q i£,ab\£, p+q ~ 2 (Bernardeau &: Schaeffer 1992; Bernardeau 1995). This property of factorization is related 
to the fact that the bias associated with an overdense object is an intrinsic property of the object. The bias associated with a 
pair comprising two different objects can be factorised into two separate quantities which each represent the intrinsic bias of 
one constituent. In recent work Munshi & Melott (1998) were able to show that predictions such as C pq = C p iC q \ are satisfied 
very accurately in highly non-linear regime when two over dense cells are not separated by a large distance and (£ij/£i) < 1 is 
satisfied. Other related predictions regarding bias have also been tested successfully by Munshi et al. (1998). The Hierarchical 
ansatz implies natural predictions for other higher order quantities, such as one point cumulants for collapsed objects which 
we will also compute here. A comparison of such predictions against numerical simulations is left for future work. 

In a similar vein, we can consider a series expansion of x(Ai, • ■ ■ , A 9 ) in (£i 3 /£i) we get higher order correction terms that 
involve contribution from the n to the same order. In leading order it can be shown that all over-dense cells are uncorrelated. 
Each higher-order term encodes the effect of one more extra neighbouring cells on occupancy of a given single cell. 
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After a tedious but straightforward exercise in algebra it is possible to express every order of x(Ai • • • A 9 ) in terms of t-°\ It 
is interesting to note that every new power of (£ij/£i) in the expansion of x(Ai • • • A q ) adds a new "star" vertex of same order. 
What is most remarkable however is that such an expansion is also able to reproduce other diagrams such as the "snake" and 
hybrid of "stars" and "snakes" of lower orders. 
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We have reintroduced quantities jj, n that represent vertices in the tree-level diagrams of MCCs, n representing the number 
of external legs associated with such vertices. In terms involving ordered sums we can replace — ^ ' ^ ' by , ^ ' (i.e. sum 

i i^j pairs 

over all pairs) and similarly ^ by (i.e. sum over all triplets) and additional terms which contribute to loop 

i iy^j i^k triplets 

corrections to lower order terms. Similarly — can be replaced by and loop contribution to lower 

i ij^j i^k i^l quadruplets 

order terms. 

It is also pertinent to note that, at every order, the hierarchical ansatz reproduces the underlying tree structure of 
matter correlation, but with different amplitudes for the vertices. It also produces different loop corrections to lower order 
diagrams. Since we are only interested in the leading order contributions to the statistics of collapsed objects (important in 
large separation limit) we will only focus on tree level diagrams. If we however focus on statistics of the initial Gaussian field it 
is important to note that since the reduced correlation functions of odd orders are zero, one needs to consider these corrective 
terms which are neglected here as in this case they will provide the dominant contributions. 

The expression of [i n to arbitrary order can be used to construct statistics of collapsed objects, such as the probability dis- 
tribution function or the void probability function, quantities which are generally easier to extract from numerical simulations 
than the higher order moments themselves. We begin with: 



fj-i{y) = 


-yG'(r( y )) 






Mf) = 


-yG"(r(y)) 
l + yG"(r(y)) 






^i(y) = 


-yG"'(r(y)) 






(l + yG"(r(y)r 






My) = 


-yG IV {r{y)) 


3j/ 2 G"'(r(y)) 2 




[l + yG"(r(yW 


+ [1 + yG"(T(y)Y> 




Ms(y) = 


-yG V {r{y)) 


Wy 2 G IV (r(y)G'"(r(y)) 


l^G'"(r{y)f 


[l + yG"(r(yW 


[l + yG"(r(yW 


[l + yG"(r(y)Y 



(21) 

The series expansion of these functions will reproduce terms which we have earlier obtained from counting internal degrees of 
freedoms associated with different nodes in tree representation of the RCCs. One can then construct the successive approxi- 
mations to V as follows: 

7> (0 >(Ai,...,A,) = JJp(Ai) 

i 

r (o) r (°) 

v m (\i,...,x g ) = V-upMfo^Aj)-^- TT v(\k) 

(0) / \ (0) 

7>< a >(Ai,...,A,) = V ^Vimn^-ViXMik^ViXu) TT V(\i) 

(l,j,k) l^,3,k 

I \ r (°) (0) (0) 

P^(Ai,...,A 9 ) = V ^ V {\ l )^^ V (\ ] )^ k T -j-V{\k)t a T -± r V{\ l ) TT V(\m) 

(i,j,k,l) m^i,j,k,l 



(0) / \ / \ (0) 

I \ r (°) (0) (0) (0) 

■p (4) (Ai, . . . , Aq) = V ^p(A l )^^P(A 3 )^^P(A fc )^ I ^^(AO^^^(A m ) TT P(A, 



+ V ^P(A0^™ Z ^(A J )^ fc ^^(A fc )^ i ^ Z ^(A0^7^^(A m ) [] P(A r 

/ ■ - , , v Si Sj Sfc Si U , . . , . 

(i,j,k,l,m) r^i,j,k,l,m 



+ V ^ViX^^PiX^^ViX^^MviX^t^ViXrn) TT W ( 22 ) 

5» Sj Sfc Si ?m , . . , . 

The quantity P(X) plays the role of the generating function for the one-point probability distribution P(N). If one 
incorporates the "bias" for a cell of occupation number N by b(N) then the generating function for P(N)b(N) can be seen to 
be 7 3 (A)r(y)/^. Similarly, V(X)fi, s (y) /£ is the generating function for P(N)v a (N). To summarise: 

5>"P(JV) = P(A); 

V \ N P(N)b(N) = P(A)^3; 
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^A^P(iV), 2 (iV) = P(A)^H; 

VaVW.^JV) = P(A)^B. (23) 

52 

Using the generating functions we have obtained (21), we can then express the count probability distribution function to 
successive order: 

p<°>(n) = n p (^)' 

i 

P (1) (N) = ^P(iV fc ) Yl 6(iV,)P(iV,)^P(iV i )&(iV i ); 

P (2) (N) = J2 P ( JV ') II MWPiNJZijPiNiWNitekPiNHWNH); 

P (3)( N ) = ^ P (jv m ) JJ vsiN^PiNiKijPiN^biNjKikPiNkWNk^uPiNtWNt) 

m^i,j,k,l (i,j,k,l^m) 

+ J2 P(iVm) II ^(^)K^i)6^2(iV j )P(iV j )^^2(iV fc )P(iV fc )6 i P(iV i )6(iV i ); 

r^=i,j,k,l,m (i,j,k,l^m) 

P (4) (N) = ^ P(JV r ) [| ^ 4 (iV^P(iV0^P(iV j )6(iV,)^P(iV fc )6(iV fc )^P(iV06(iV06 m P(iV m )6(iV m ) 

r^i,j,k,l,m (i,j,k,l,m^r) 

+ ]T P ( jVm ) II p (^0K^0^'^(iV J )P(iV J )C Jfc P(iV fe )6(iV fe )O^2(iV0P(iV06 m P(iV m )b(iV m ) 

r^=i,j,k,l,m (i,j,k,l^m) 

+ Y P ( jV -) II ^(WW^iV^P^^^^ (24) 

r^i,j,k,l,m (i 7 j 7 k,l^m) 

where we have used the notation P'*'(N) = P^\Ni, . . . , N q ) for arbitrary number of cells g. The product terms involve 
pairs for i = 1, triplets for i = 2, quadruplets for i = 3 and pentuplets for i = 4. The functions 6(A r ) = fii(N) and /j,n(N), 
which describe the effect of correlations between different cells, will each satisfy two normalization conditions satisfied by the 
multi-point count probability distribution function (CPDF) P(Ni, . . . ,Nk). The first set of these conditions is obtained by 
taking the first order moment of P(Ni, . . . , Nk) with respect to all but one of the Nk, and the second set is obtained by taking 
moments with respect to all Nk and using the definition of the underlying matter correlation function. Hence we impose 

Y P(N)v s (N) = 0; Y NP(N)v 3 (N) = Nv s , (25) 

where un are the amplitudes associated with vertices of underlying mass distribution in the highly nonlinear regime. 

Using inverse Laplace transforms, it is now possible to relate the distributions P(N) or P(N)v s (N) in terms of their 
generating functions P(A) or in terms of fJ, s (y)- In turns out that the results can be expressed in terms of functions of a scaling 



variable x = N/N c , where N c — N£ 2 and N is the typical occupation number of cells under consideration. 

pwKN) = ^f^v W m = £ r mit X ) 



P(N)v 3 (N) = ^|^mi^ = — M^Lri (2!,) 



dX r>r.\i M3 ^ = — - 



In turn the scaling functions /i(a;) and b(a:), which describe the behaviour of P(N), b(N) and v s {N) for different length 
scales and for different levels of non-linearity, are expressed in terms of the vertex-generating functions /j, s (y) we introduced 
earlier in equation (21): 

i r°° 

= / d y ya(y)exp(yx); 
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J ioo 

i r°° 
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J ioo 

1 r°° r(y) 
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2m / y 
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2m / 

^ r-i oo 

v z {x)h{x) = -— / dy ^ 3 (y)exp(yx); 

J ioo 

2m / 

•/ IOC 

in which the further functions /i(> x), z/ s (> x), etc. are defined by 
/i(> x) = h{x)dx\ 

J X 

f-OO f-OO 

b(> x) = / h{x )b{x')dx I I h(x')dx'; 

J X J X 



v 2 {> x)h{> x) = 77— / dyt-^Lexp(yx); 

y 



■'■)/'( : - — / dy^exp(yx), (27) 



v 2 (> x) — I h{x)v 2 {x')dx I h{x)dx; 

J X J X 

poo poo 

^3(> x) = I h(x )vz(x')dx I I h(x')dx'; 

J x J X 

poo poo 

va(> x) — / h(x')v4,{x')dx I I h{x')dx . (28) 



One of the reasons for using the cumulative scaling functions (such as h(> x) or pi(> a;)) instead of h(x) and fj, s (x) is that 
one can more easily incorporate a threshold in cell occupation numbers to define the overdense objects. Cell statistics defined 
in this way are much more stable than differential count statistics and easier to test against numerical simulations, as they 
depend less on specific models for underlying mass distribution characterized by G(r). 



4 MULTI-POINT CUMULANTS AND STATISTICS OF COLLAPSED OBJECTS 

We can now discuss the statistics of overdense cells in terms of their vertex representation. Using the results of the previous 
section, we can assign a weight M s to vertices of the trees representing the distribution of overdense cells, such that with 
M s plays the same role as v 3 in the tree expansion for the underlying distribution. A complete knowledge of all the nodes 
for overdense cells (or their generating function) guarantees a full statistical description for collapsed objects, including the 



one-point cumulants S% and cumulant correlators C% M , the subscripting referring to the collapsed objects which we assume 
can be selected by applying a density threshold to the cells . In the following calculations give expressions for orders < 5. The 
quantities M s can be written 

We can write the one-point cumulants in terms of the amplitudes M a as follows, which are similar to the case of underlying 
mass distribution: 

«<>*> - ^ +I2 (S^)' 

= 4M 3 (> x) + 12M 2 (> xf 

v ' b{> x) 3 b(> x) 3 \b(> xf ) \b{> x) 2 J 

= 5M 4 (> x) +60M 3 (> x)M 2 (> x) +60M 2 (> x) 3 . (30) 
The two-point cumulants are given by 

= 2§^J = 2M 2 (> z) 
o(> a;J z 

,M3(> x) ( ni(> x)^ 



b(>x) 3 U(>x) 2 
= 3M 3 (> x) + 6M 2 (> xf 

= 4M 4 (> x) + 36M a (> ^)M 2 (> x) + 24M 2 (> a;) 3 . (31) 

In the following two subsections we obtain approximate forms for the amplitudes M a in two different limiting cases. For the 
determination of Sn and Cnm parameters for all values of mass it is necessary to evaluate contour integrals relating v 2 {x) 
and lJL 2 {y) numerically, an exercise we do not attempt here. 

4.1 The case of moderately over-dense objects (x « 1) 

The asymptotic forms of the different scaling functions in the limit x << 1 are depend upon the limiting behaviour of the 
scaling functions when y » 1. In this case, the scaling function o{y) is known to exhibit a power law profile with amplitude 
a and power-law index ui both depending on initial power spectral index (for a detail derivation see Balian & Schaeffer 1989, 
and Bernardeau & Schaeffer 1992): 

a(y) = ay-; h(x) = a^-^x^ 2 , (32) 

where F(-) is the T-function, and we use the specific form of G(r) given by Bernardeau & Schaeffer (1992): 

G(r) = (l + Ly K . (33) 

The asymptotic behaviour of G(r) can then be expressed as 

G(r) oc ct~ k , (34) 

and this can be used to compute the asymptotic form for a(y): 

i 

r (y) = (y CK ) <K+2) 

<r(y) = (l + ^)KT^c~Tzf2)yWTv (35) 
Using these expressions we can relate c and k with a and u>: 

k = (fcc) 1/(fc+2) = (2acu) 1/2 , (1 - w)a = fc—c 1 —. (36) 

(1-w) 

It is now possible to express the asymptotic form of jj, n as a function of y and in terms of parameters describing the asymptotic 
behaviour of a(N c ), i.e. a and lu: 

My) = r(y) = (2ao;) 1 /V 1 ^ )/2 
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Figure 3. Lower-order cumulants and cumulant correlators are plotted as a function of to for the limiting case of a; << 1. There are no 
analytical results connecting ui with the initial spectral index, n, but simulations suggest that a; increases with the amount of small-scale 
power. Determinations of lo for different scale free initial conditions in 2D and 3D were made by Munshi et al. (1997). Our results show 
that both cumulants and cumulant correlators for overdense regions increase with the amount of small-scale power, which reflects similar 
a behaviour for matter cumulants and cumulant correlators. 
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Using these asymptotic forms and the definitions of h(> x)fi n (> x) introduced earlier in equations (28), we can can write 

h{>x) = 4r 

vi{> x)h(> x) — h(> x)fj,i(> x) = (2o»o) 



f2(> x)h(> x) 



r[¥] 
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= ( '-"I 6 . + ^W 'ggj- m 

It is interesting to note the power of the bias 6(> x) that appears in each of these expressions is precisely that required that, 
when divided by the appropriate quantity to evaluate M s , the parameters for collapsed objects in the limit x « 1 all become 
independent of x: 



M 3 (x) 
M A {x) 



4 r[i + w]' 
16c.; 2 r[o;] 2 r[^] 



_ (i-^Ks + a,) r[i±^] 4 

M5W ~ 64^ IWT[2-w]- (39) 

The implications for cumulants and cumulant correlators of overdense regions are illustrated in Figure 3. 



4.2 The case of highly over-dense objects (x » 1) 



It is also possible to evaluate the Sn parameters in the limit when x » 1. The dominant contribution to this case comes 
from the singular nature of a(N c ) for small negative values of y. The value of y = y 3 for which a(y) becomes singular is given 
by the solution of the equation 

where t s can be evaluated by solving the implicit equation 

-m 

which really depends on the exact modelling of G(r) in the nonlinear regime. However, as we will show, in the limit of rare 
events where x takes very large values, the hierarchical amplitudes become independent of the initial conditions and this 
considerably simplifies things. The quantity a^y) plays a very important role in determining various scaling properties and is 
known to have a singularity for small but negative values of y which determines the asymptotic values of Sn for large values 
of x. We can express a{y) near the singularity as 

a-a s = -a s r(-^)(y-y s f 2 ; (42) 
with 

- = ^^ WG »w { 2G ' { X't ] ) • (43) 

Such a singularity produces an exponential cut-off in the power law profile of h(x) above, and which we can express as 

h(x) = -a a y a x~ 5/2 exp(y a x). (44) 

Differentiating y twice with respect to x and using the fact that the first order derivative of r with respect to y vanishes at 
the singular point y s it is possible to show that, near the singular point, 

(.-» = (^)"'(, -».)"'. m 

Similarly, the other /j, n (y) functions can also be expressed as a function of y near the singularity y s . We list only those terms 
which will dominate in determining the Sn parameters for large values of x. 

-yG"(r) G"{t s ) ( 2G'{t s )G"{t s ) \-^ 1/2 
= l + yG"(T) = G^W){ G>"(t 3 ) ) {y - ys) 



W(V ) = ~ yG "'^ G"(r s ) 2 ( 2G'(r s )G"(r s ) y^ 3/2 
^ Ki " (l + yG"(r)f G"(t s ) \ G"'(r s ) :i J {y ^ 

„ 4(y) = Z^M i 3y 2G '"(-f 3G,, (^) 3 ( 2G'(r s )G"(r s ) y^ 

^ Ki " " (l + yG"(r))^ (l + yG"(r)Y' G"'(t s ) 3 G'(t s ) \ G"'(t s ) J {y ys) ' { ' 

Using these relations in the expressions for ii n {> x), it is possible to show that, 

b(>x) = M>^-(^) 

^ >x) ^ (g^)) 2 ^ 

= (c^) 4 - (48) 
Using the appropriate definitions of the Sn and Cnm parameters, it is now straightforward to obtain 

S N = N N - 2 , C c N1 =N N -\ C C N11 =N N . (49) 

It is interesting that all the fi„(> x) become equal to unity in this limit, so that the Sn parameters are simply equal to 
the number of tree diagrams of that order, i.e. N N ~ 2 . Exactly the same result has also been derived from the extended Press- 
Schechter formalism (Press & Schechter 1974; Mo et al. 96). It is remarkable and encouraging that the hierarchical ansatz, 
which is known to be valid only in the highly non-linear regime, predicts the same results as this alternative formalism, which 
is based on smoothing of the initial density field. 



5 DISCUSSION 



Our starting point for this study has been the most general form of hierarchical model for the matter correlation functions 
in the highly non-linear regime. This is equivalent to the assumption that the correlation hierarchy can be built up as a tree 
structure where each vertex is assigned a different weight (the vertex amplitude), but which is independent of geometrical 
form factors. The amplitudes associated with vertices are left arbitrary. We have adopted the method of generating functions 
introduced by Bernardeau & Schaeffer (1992) to show that such a hierarchy induces a similar tree structure for overdense 
cells, which we take to represent collapsed objects. 

We then expanded the generating functions for the joint probability distributions of several cells in terms of a series in 
powers of At zeroth order it reproduces one-point PDF with no correlation between different cells. At linear order of 

two-point correlations between different cells can be used to compute the bias of overdense cells with respect to the 
underlying matter distribution. At second order of a hierarchical form for the three-point correlation of overdense cells 

is reproduced and this can be used to compute the hierarchical amplitude Mg associated with collapsed objects. Similarly, each 
higher-order term reproduces the amplitude of a new vertex and the associated multipoint cumulant correlators (MCCs) for 
collapsed objects. Our analysis is also able to reproduce the exact number of all the higher-order hybrid and snake diagrams 
with their associated tree amplitudes. We found that the one-point and the multi-point statistics of collapsed objects depend 
physically on properties intrinsic to the objects, and thus depend mathematically only on the scaling variable x = N/N c . 

Assuming a very general model for the generating function G(r), for the underlying mass we have computed cumulants 
S N and cumulant correlators C N1 for overdense cells. We have carried out this analysis in two limiting cases, of moderately 
overdense cells x « 1 and extremely overdense cells x » 1, respectively. We find that for extremely overdense cells the 
results are independent of initial conditions, the amplitudes associated with vertices of arbitrary order tend asymptotically 
to unity, and the generating functions can be expressed in a closed form G(r) = exp(— r). This closed-form solution can be 
used to compute all other statistical quantities for over-dense cells such as the count probability distribution function (CPDF) 
or the void probability function (VPF) for overdense cells. For moderately overdense cells, the results depend on the initial 
power spectrum and we find that the Sn parameters for overdense cells decrease with increasing small-scale power in this 
case. 

We have also developed a diagrammatic method for counting different ways in which a general correlation function £jv 
with N arguments can be grouped in such a way that p of its nodes are associated with one particular cell, q are in a second 
cell, and so on. In leading order such a grouping of vertices in different cells generates a new level of the tree hierarchy which 
can be related to the correlation hierarchy for overdense cells. The only difference between the trees for the underlying mass 
distribution and those for collapsed objects lies in the internal structure of their constituent nodes. Trees associated with 
collapsed objects have internal degrees of freedom for different vertices that appear in different ways. As we have shown, 
however, it is possible to group nodes associated with underlying mass trees in such a way as to replicate the tree structure 
for collapsed objects with same external topology but different internal weights. Our method of counting internal degrees of 
freedom associated with the vertices matches with analytical results obtained by more laborious means. 

In an exactly similar manner it is possible to use the trees associated with collapsed objects to study how clusters or 
groups of collapsed objects are correlated with each other. In this case, the nodes associated with groups of collapsed objects 
will have many more internal degrees of freedom as they will consist of nodes of tree structures of collapsed objects which 
themselves are made of point-like vertices of the underlying mass distribution. Amplitudes associated with nodes appearing 
in the tree structure of groups of collapsed objects will definitely depend on the mass of objects that constitute the group, 
and will have scaling properties associated with them. We shall present a complete analysis of this problem in near future. 

/.From an observational point of view, the results contained in this paper are important for estimating finite volume 
corrections. It is well known that the finite size of galaxy catalogs introduce errors in estimations of cumulants and cumulant 
correlators. Error estimation for values of the Sn parameters - normalised moments of the CPDF - extracted from finite 
catalogs involve volume integrals of the two-point joint probability distribution function. Such an analysis, carried out by 
Szapudi & Colombi (1996), involves the generating function Hi(y) = r(y). Similar estimation procedures for multi-point 
cumulant correlators will involve higher-order generating functions describing correlation between more than two cells. Our 
analytical expressions for /in will be useful in such analyses. In a slightly different approach, it is also possible to talk about the 
probability distribution in the context of error estimation. Estimated values of different statistical quantities are affected by the 
presence or absence of rare objects in a finite size catalog. The higher the order of cumulant, the higher the contribution from 
the high-iV tail of the CPDF, which is highly sensitive to presence or absence of rare objects. It is natural, therefore, to expect 
that our analytical results determining the generating function for rare objects can be related to the probability distribution 
of estimated errors. This will extend earlier studies in this direction where only the variance of errors were considered. This 
is important because we know that finite volume correction not only introduces large error but measurements are more likely 
to underestimate the true values of cumulants or cumulant correlators. 

Cumulants for collapsed objects have also been computed using alternative procedures, chiefly by extension of the Press- 
Schechter formalism (Bond et al. 1991; Bower 1992). Our results complement these studies, because they are based on a fully 
nonlinear approach and depend only on the assumption that there is a hierarchical structure of the correlation functions. 
We think this ansatz is based in relatively firm ground, and can, at least in principle, be directly related to the microscopic 
physics of gravitational clustering through the BBGKY equations. The analysis can also be applied the MCCs of arbitrary 
order which have not yet been investigated using any other formalism. Our results for computation of Sn parameters for 
collapsed objects match with such an analysis done using extended Press-Schechter formalism. However similar comparison 



for moderately over-dense objects are difficult as different prescriptions are used to define collapsed objects in these two 
formalisms. We plan to compare our theoretical predictions against numerical simulations both in 3D as well as in 2D, and 
will present these results in due course. 

It is also important to note that, owing to our incomplete knowledge of gravitational clustering in the highly non-linear 
regime, at least two different ansatze are often used to study nonlinear clustering, namely the hierarchical ansatz (which 
we discuss here) and the stable clustering ansatz and these should not be confused. While the hierarchical ansatz relates 
higher-order correlation functions with the two-point correlation function, the stable clustering ansatz on the other hand 
relates the power-law index of the two-point correlation with the initial power spectral index. Thus these two ansatze are not 
contradictory but complement each other. Our analysis is based completely on the hierarchical ansatz, and we have never 
used stable clustering to derive our results. 

It may also be possible to extend our analysis to the joint probability distribution function of densities of concentric cells, 
with their centres coinciding with the centre of a dark halo. In such a context it will be interesting to check if our analysis can 
also predict the inner structure of haloes , i.e. their density profiles. This will also clarify the issue of whether there exists a 
universal halo profile as proposed by Navarro et al. (1996, 1997) and how it relates to stable clustering and to the hierarchical 
ansatz. We will report these results elsewhere. 
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